An Efficient Stepwise Statistical Test to Identify Multiple Linked Human Genetic Variants Associated with Specific Phenotypic Traits

نویسندگان

  • Iksoo Huh
  • Min-Seok Kwon
  • Taesung Park
  • Zhongxue Chen
چکیده

Recent advances in genotyping methodologies have allowed genome-wide association studies (GWAS) to accurately identify genetic variants that associate with common or pathological complex traits. Although most GWAS have focused on associations with single genetic variants, joint identification of multiple genetic variants, and how they interact, is essential for understanding the genetic architecture of complex phenotypic traits. Here, we propose an efficient stepwise method based on the Cochran-Mantel-Haenszel test (for stratified categorical data) to identify causal joint multiple genetic variants in GWAS. This method combines the CMH statistic with a stepwise procedure to detect multiple genetic variants associated with specific categorical traits, using a series of associated I × J contingency tables and a null hypothesis of no phenotype association. Through a new stratification scheme based on the sum of minor allele count criteria, we make the method more feasible for GWAS data having sample sizes of several thousands. We also examine the properties of the proposed stepwise method via simulation studies, and show that the stepwise CMH test performs better than other existing methods (e.g., logistic regression and detection of associations by Markov blanket) for identifying multiple genetic variants. Finally, we apply the proposed approach to two genomic sequencing datasets to detect linked genetic variants associated with bipolar disorder and obesity, respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using stepwise regression to identify ISSR molecular markers associated with agronomic traits in ispaghula (Plantago ovata Forssk.) ecotypesa ecotypes

In this study, the associations between ISSR markers with some agronomic traits in 22 ispaghula ecotypes were used by stepwise regression analysis. The results of stepwise regression analysis showed a significant association between traits and some of loci markers positions. For some traits was detected more than one informative marker. Totally 90 informative ISSR markers were revealed that due...

متن کامل

TATES: Efficient Multivariate Genotype-Phenotype Analysis for Genome-Wide Association Studies

To date, the genome-wide association study (GWAS) is the primary tool to identify genetic variants that cause phenotypic variation. As GWAS analyses are generally univariate in nature, multivariate phenotypic information is usually reduced to a single composite score. This practice often results in loss of statistical power to detect causal variants. Multivariate genotype-phenotype methods do e...

متن کامل

Detecting epistasis with the marginal epistasis test in genetic mapping studies of quantitative traits

Epistasis, commonly defined as the interaction between multiple genes, is an important genetic component underlying phenotypic variation. Many statistical methods have been developed to model and identify epistatic interactions between genetic variants. However, because of the large combinatorial search space of interactions, most epistasis mapping methods face enormous computational challenges...

متن کامل

Gene Based Association Approach Identify Genes Across Stress Traits in Fruit Flies

Identification of genes explaining variation in quantitative traits or genetic risk factors of human diseases requires both good phenotypicand genotypic data, but also efficient statistical methods. Genome-wide association studies may reveal association between phenotypic variation and variation at nucleotide level, thus potentially identify genetic variants. However, testing million of polymor...

متن کامل

A two-stage inter-rater approach for enrichment testing of variants associated with multiple traits

Shared genetic aetiology may explain the co-occurrence of diseases in individuals more often than expected by chance. On identifying associated variants shared between two traits, one objective is to determine whether such overlap may be explained by specific genomic characteristics (eg, functional annotation). In clinical studies, inter-rater agreement approaches assess concordance among exper...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2015